A bark coherence function for perceived speech quality estimation
نویسندگان
چکیده
A new methodology for perceptual quality measure is presented. The new method defines the bark coherence function (BCF) as a new cognition module. False prediction errors are occasionally observed in previously developed perceptual measures when they are applied to the end-to-end speech quality measurement of communication systems. Those errors are mainly caused by the linear distortion of the analogue interface of the system being evaluated. The BCF itself normalizes those effects of linear filtering, so that it is ideal for the speech quality assessment of mobile communication systems. In addition, the proposed scheme does not require the local as well as global scaling, so that it is robust to the difference between the original and received speech levels. To evaluate the performance of the new perceptual model, the regression analysis was performed with CDMA digital cellular, CDMA personal communication service (PCS) and speech codec’s. The correlation coefficients computed using the BCF showed noticeable improvements over the PSQM that is recommended by ITU-T. Robustness of the BCF to various conditions was also tested.
منابع مشابه
Speech quality measure for voIP using wavelet based bark coherence function
The Bark Coherence Function (BCF) [1] defines a coherence function with loudness speech as a new cognition module, robust to linear distortions due to the analog interface of digital mobile system. Preliminary experiments have shown the superiority of BCF over current measures. In this paper, a new BCF suitable for VoIP is developed. The new BCF is based on the wavelet series expansion that pro...
متن کاملDual channel speech enhancement using coherence function and MDL-based subspace approach in bark domain
A novel algorithm for dual channel speech enhancement is presented. It combines the coherence function and a subspace approach in the Bark domain together with an optimal subspace selection through the minimum description length (MDL) criterion. The coherence function allows one to exploit the spatial diversity of the sound field. The processing in the Bark domain permits to take into account o...
متن کاملObjective estimation of perceived speech quality. I. Development of the measuring normalizing block technique
Perceived speech quality is most directly measured by subjective listening tests. These tests are often slow and expensive, and numerous attempts have been made to supplement them with objective estimators of perceived speech quality. These attempts have found limited success, primarily in analog and higher-rate, error-free digital environments where speech waveforms are preserved or nearly pre...
متن کاملText-to-speech voice adaptation from sparse training data
Voice adaptation describes the process of converting the output of a text-to-speech synthesizer voice to sound like a different voice after a training process in which only a small amount of the desired target speaker’s speech is seen. We employ a locally linear conversion function based on Gaussian mixture models to map bark-scaled line spectral frequencies. We compare performance for three di...
متن کاملReal-Time, Non-intrusive Evaluation of VoIP
Speech quality, as perceived by the users of Voice over Internet Protocol (VoIP) telephony, is critically important to the uptake of this service. VoIP quality can be degraded by network layer problems (delay, jitter, packet loss). This paper presents a method for real-time, non-intrusive speech quality estimation for VoIP that emulates the subjective listening quality measures based on Mean Op...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2000